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[Name of Document] SPECIFICATION 
[Title of the Invention] 

Sound Signal Analyzing Device, Sound Signal Analyzing Method and Storage Medium 
[Patent Claims] 

[Claim 1] A sound signal analyzing device comprising: 

input means for inputting a desired sound signal; 

characteristic extraction means for extracting a characteristic of the sound signal input 
from said input means; and 

setting means for setting various parameters for use in analysis of the sound signal, in 
accordance with the characteristic of the sound signal extracted by said characteristic extraction 
means. 

[Claim 2] A sound signal analyzing device as recited in claim 1 wherein said characteristic 
extraction means extracts a volume level of the sound signal as said characteristic, and said 
setting means sets a threshold value for use in the analysis of the sound signal, in accordance 
with the volume level extracted by said characteristic extraction means. 

[Claim 3] A sound signal analyzing device as recited in claim 1 wherein said characteristic 
extraction means extracts upper and lower pitch limits of the sound signal as said characteristic, 
and said setting means sets a filter characteristic for use in the analysis of the sound signal, in 
accordance with the upper and lower pitch limits extracted by said characteristic extraction 
means. 

[Claim 4] A sound signal analyzing device comprising: 
input means for inputting a desired sound signal; 

pitch extraction means for extracting a pitch of the sound signal input from said input 
means; and 

scale allocation means for allocating the pitch of the sound signal extracted by said pitch 
extraction means to any of scale notes, on the basis of a frequency of the extracted pitch, said 
scale allocation means having a plurality of conditions for allocation, the respective conditions 
being set so that frequency bands of a pitch to be allocated to each scale notes are different each 
other while the respective conditions are used for assigning to the same scale notes. 
[Claim 5] A sound signal analyzing method comprising: 

a step of inputting a desired sound signal to be analyzed; 

a step of extracting a characteristic of the sound signal from the input sound signal; and 
a step of setting various parameters for use in analysis of the sound signal, in accordance 

with the characteristic of the extracted sound signal. 

[Claim 6] A sound signal analyzing method as recited in claim 5 wherein said step of 

extracting extracts a volume level of the sound signal as said characteristic, and said step of 

setting sets a threshold value for use in the analysis of the sound signal, in accordance with the 

extracted volume level. 

[Claim 7] A sound signal analyzing method as recited in claim 5 wherein said step of 
extracting extracts upper and lower pitch limits of the sound signal as said characteristic, and 
said step of setting sets a filter characteristic for use in the analysis of the sound signal, in 
accordance with the extracted upper and lower pitch limits. 

[Claim 8] A sound signal analyzing method comprising: 
a step of inputting a desired sound signal to be analyzed; 

a step of extracting a pitch of the sound signal from the input sound signal; and 

a step of allocating the pitch of the extracted sound signal to any of scale notes, on the 



basis of a frequency of the extracted pitch, said step of allocating having a plurality of conditions 
for allocation, the respective conditions being set so that frequency bands of a pitch to be 
allocated to each scale notes are different each other while the respective conditions are used for 
allocating to the same scale notes. 

[Claim 9] A machine-readable medium containing a group of instructions of a sound signal 
analyzing program for execution by a computer, said sound signal analyzing program comprising 
the steps of: 

a step of inputting a desired sound signal to be analyzed; 

a step of extracting a characteristic of the sound signal from the input sound signal; and 
a step of setting various parameters for use in analysis of the sound signal, in accordance 

with the characteristic of the extracted sound signal. 

[Claim 10] A machine-readable medium containing a group of instructions of a sound signal 

analyzing program for execution by a computer, said sound signal analyzing program comprising 

the steps of: 

a step of inputting a desired sound signal to be analyzed; 

a step of extracting a pitch of the sound signal from the input sound signal; and 
a step of allocating the pitch of the extracted sound signal to any of scale notes, on the 
basis of a frequency of the extracted pitch, said step of allocating having a plurality of conditions 
for allocation, the respective conditions being set so that frequency bands of a pitch to be 
allocated to each scale notes are different each other while the respective conditions are used for 
allocating to the same scale notes. 

[Detailed Description of the Invention] 
[0001] 

[Technical Field of the Invention] 

The present invention relates generally to sound signal analyzing devices and methods for 
creating a MIDI file or the like on the basis of input sounds from a microphone or the like, and 
more particularly to an improved sound signal analyzing device and method which can 
effectively optimize various parameters for use in sound signal analysis. 
[0002] 
[Prior Art] 

Examples of the conventional sound signal analyzing devices include one in which 
detected volume levels and highest and lowest pitch limits, etc. of input vocal sounds have been 
set as parameters for use in subsequent analysis of sound signals. These parameters are 
normally set in advance on the basis of vocal sounds produced by ordinary users and can be 
varied as necessary by the users themselves when the parameters are to be put to actual use. 
[0003] 

[Problems to Be Solved by the Invention] 

However, because the input sound levels tend to be influenced considerably by the 
operating performance of hardware components used and various ambient conditions, such as 
noise level, during sound input operations, there arises a need to review the level settings from 
time to time. Further, the upper and lower pitch limits would influence pitch-detecting filter 
characteristics during the sound signal analysis, and thus it is undesirable to immoderately 
increase a difference or width between the upper and lower pitch limits. Unduly increasing the 
width between the upper and lower pitch limits is undesirable in that it would result in a wrong 
pitch being detected due to harmonics and the like of the input sound. In addition, because the 



conventional sound signal analyzing devices require very complicated and sophisticated 
algorithm processing to deal with the pitch detection over a wide pitch range, the processing 
could not be readily carried out in real time. Moreover, even for some of the parameters 
appropriately modifiable by the users, it is necessary for the users to have a certain degree of 
musical knowledge, and therefore it is not desirable for the users to have freedom in changing 
the parameters. However, because some of the users may produce vocal sounds of a unique 
pitch range far wider than those produced by ordinary users or of extraordinary high or low 
pitches, it is very important that the parameters should be capable of being modified as necessary 
in accordance with the unique tendency and characteristics of the individual users. 
[0004] 

It is therefore an object of the present invention to provide a sound signal analyzing device, 
a sound signal analyzing method and a storage medium, which can modify various parameters 
for use in the sound signal analysis in accordance with types of the parameters and 
characteristics of a user's vocal sound. 
[0005] 

[Means for Solving the Problems] 

A sound signal analyzing device according to the present invention as recited in claim 1 
at the time of filing the application, which comprises: input means for inputting a desired sound 
signal; characteristic extraction means for extracting a characteristic of the sound signal input 
from said input means; and setting means for setting various parameters for use in analysis of the 
sound signal, in accordance with the characteristic of the sound signal extracted by said 
characteristic extraction means. Because of the arrangement that a characteristic of the input 
sound signal is extracted via the extraction means, even when the input sound signal variously 
differs depending on its sound characteristic (such as a user's singing ability), various parameters 
can be appropriately altered in accordance with the difference in the extracted characteristic of 
the sound signal, which thereby greatly facilitates setting of the necessary parameters. 
[0006] 

A sound signal analyzing device according to the present invention as recited in claim 2 at 
the time of filing the application is an embodying mode of the sound signal analyzing device 
according to claim 1, wherein said characteristic extraction means extracts a volume level of the 
sound signal as said characteristic, and said setting means sets a threshold value for use in the 
analysis of the sound signal, in accordance with the volume level extracted by said characteristic 
extraction means. Thus, by setting an appropriate threshold value for use in the sound signal 
analysis, a start point of sounding, i.e., key-on timing, is extracted from individual users' vocal 
sound characteristics because the start point of sounding is different between different users. 
[0007] 

A sound signal analyzing device according to the present invention as recited in claim 3 at 
the time of filing the application is an embodying mode of the sound signal analyzing device 
according to claim 1, wherein said characteristic extraction means extracts upper and lower pitch 
limits of the sound signal as said characteristic, and said setting means sets a filter characteristic 
for use in the analysis of the sound signal, in accordance with the upper and lower pitch limits 
extracted by said characteristic extraction means. By setting the filter characteristic for the 
sound signal analysis to within an appropriate range, it is possible to effectively avoid the 
inconvenience that a harmonic pitch is detected erroneously or a pitch to be detected can not be 
detected at all. 
[0008] 



A sound signal analyzing device according to the present invention as recited in claim 4 at 
the time of filing the application, which comprises: input means for inputting a desired sound 
signal; pitch extraction means for extracting a pitch of the sound signal input from said input 
means; and scale allocation means for allocating the pitch of the sound signal extracted by said 
pitch extraction means to any of scale notes, on the basis of a frequency of the extracted pitch, 
said scale allocation means having a plurality of conditions for allocation, the respective 
conditions being set so that frequency bands of a pitch to be allocated to each scale notes are 
different each other while the respective conditions are used for allocating to the same scale 
notes. Conventional scale allocation means allocates a pitch of a sound signal to such a scale 
note among scale notes of a particular scale, on the basis of a frequency thereof, that is seemed to 
have a frequency most close to the frequency of the pitch of the sound signal. For example, in 
such case that scale notes of a scale to be allocated correspond to white keys, despite that a user 
intended to sound a tone corresponding to a black key, an allocation of the tone is made to a scale 
note of any white key. On the other hand, in such case that a scale to be allocated is a 12-tone 
scale corresponding to either white keys or black keys, because the tone sounded by the user is 
automatically allocated to a scale note most close to the tone in frequency, despite that the user 
intended to sound a tone corresponding to a white key, there is a possibility that the tone sounded 
by the user is allocated to a pitch of a black key due to difference between pitches of user's sound 
and the intended white key. It is therefore, according to the present invention, while an 
allocation to the 12-tone scale is performed, a frequency range for allocating to a black key is set 
to a range narrower than a frequency range for a white key as possible, and it is configured that if 
the frequency of the tone sounded by the user corresponds to the narrow range for a black key, 
the tone is allocated to the black key. In this way, it is possible to perform an allocation process 
that the tone sounded by the user is allocated to an appropriate scale note depending to a user's 
singing ability. 
[0009] 

Further, the present invention may be implemented not only as a sound signal analyzing 
device as mentioned above but also as a sound signal analyzing method as defined in any of 
claims 5 - 8 at the time of filing the application. The present invention may also be practiced as 
a computer program and a storage medium storing such a computer program as defined in any of 
claims 9 and 10 at the time of filing the application. 
[0010] 

[Embodiments of the Invention] 

Preferred embodiments according to the present invention will be described in greater 
detail hereinbelow with reference to the accompanying drawings. 
[0011] 

Fig. 2 is a block diagram illustrating a general hardware setup of a personal computer that 
functions as a sound signal analyzing device in accordance with an embodiment of the present 
invention. This personal computer is controlled by a CPU 21, to which are connected, via a 
data and address bus 2P, carious components, such as a program memory (ROM) 22, a working 
memory 23, an external storage device 24, a mouse operation detecting circuit 25, a 
communication interface 27, a MIDI interface 2A, a mouse interface 2D, a keyboard (K/B) 
operation detecting circuit 2F, a display circuit 2H, a tone generator circuit 2J and an effect 
circuit 2K. While the personal computer may include other hardware components, the personal 
computer according to this embodiment will be described below as only including these 
hardware resources essential for implementing various features of the present invention. 



[0012] 

The CPU 21 carries out various processes based on various programs and data stored in 
the program memory 22 and working memory 23 as well as musical composition information 
received from the external storage device 24. In this embodiment, the external storage device 
24 may comprise any of a floppy disk drive, hard disk drive, CD-ROM drive, magneto-optical 
disk (MO) drive, ZIP drive, PD drive and DVD drive. Composition information and the like 
may be received from another MIDI instrument 2B or the like external to the personal computer, 
via the MIDI interface 2A. The CPU 21 supplies the tone generator circuit 2J with the 
composition information received from the external storage device 24, to audibly reproduce or 
sound the composition information through an external sound system 2L. 
[0013] 

The program memory 22 is a ROM having prestored therein various programs including 
system-related programs and operating programs as well as various parameters and data. The 
working memory 23 is provided for temporarily storing data generated as the CPU 21 executes 
the programs, and it is allocated in predetermined address regions of a random access memory 
(RAM) and used as registers, flags, buffers, etc. Some or all of the operating programs and 
various data may be prestored in the external storage device 24 such as the CD-ROM drive rather 
than in the program memory or ROM 22 and may be transferred into the working memory or 
RAM 23 or the like for storage therein. This arrangement greatly facilitates installment and 
version-up of the operating programs etc. 
[0014] 

Further, the personal computer of Fig. 2 may be connected via the communication 
interface 27 to a communication network 28, such as a LAN (Local Area Network), the Internet 
or telephone line network, to exchange data (e.g., composition information with associated data) 
with a desired sever computer. Thus, in a situation where the operating programs and various 
data are not contained in the personal computer, these operating programs and data can be 
downloaded from the server computer to the personal computer. Specifically, in such a case, 
the personal computer, which is a "client", sends a command to request the server computer 29 to 
download the operating programs and various data by way of the communication interface 27 
and communication network 28. In response to the command, the server computer 29 delivers 
the requested operating programs and data to the personal computer via the communication 
network 28. Then, the personal computer receives the operating programs and data via the 
communication interface 27 and stores them into the RAM 23 or the like. In this way, the 
necessary downloading of the operating programs and various data is completed. 
[0015] 

It will be appreciated that the present invention may be implemented by a 
commercially-available electronic musical instrument or the like having installed therein the 
operating programs and various data necessary for practicing the present invention, in which case 
the operating programs and various data may be stored on a recording medium, such as a 
CD-ROM or floppy disk, readable by the electronic musical instrument and supplied to users in 
the thus-stored form. 
[0016] 

Mouse 26 functions as a pointing device of the personal computer, and the mouse 
operation detecting circuit 25 converts each input signal from the mouse 26 into position 
information and sends the converted position information to the data and address bus 2P. 
Microphone 2C picks up a human vocal sound or musical instrument tone to convert it into an 



analog voltage signal and sends the converted voltage signal to the microphone interface 2D. 
The microphone interface 2D converts the analog voltage signal from the microphone 2C into a 
digital signal and supplies the converted digital signal to the CPU 21 by way of the data and 
address bus 2R Keyboard 2E includes a plurality of keys and function keys for entry of desired 
information such as characters, as well as key switches corresponding to these keys. The 
keyboard operation detecting circuit 2F includes key switch circuitry provided in corresponding 
relation to the individual keys and outputs a key event signal corresponding to a depressed key. 
In addition to such hardware switches, various software-based button switches may be visually 
shown on a display 2G so that any of the button switches can be selectively operated by a user or 
human operator through software processing using the mouse 26. The display circuit 2H 
controls displayed contents on the display 2D that may include a liquid crystal display (LCD) 
panel. 
[0017] 

The tone generator circuit 2J, which is capable of simultaneously generating tone signals 
in a plurality of channels, receives composition information (MIDI files) supplied via the data 
and address bus 2P and MIDI interface 2A and generates tone signals on the basis of the received 
information. The tone generation channels to simultaneously generate a plurality of tone 
signals in the tone generator circuit 2J may be implemented by using a single circuit on a 
time-divisional basis or by providing separate circuits for the individual channels on a one-to-one 
basis. Further, any tone signal generation method may be used in the tone generator circuit 2J 
depending on an application intended. Each tone signal generated by the tone generator circuit 
2J is audibly reproduced or sounded through the sound system 2L including an amplifier and 
speaker. The effect circuit 2 is provided, between the tone generator circuit 2J and the sound 
system 2L, for imparting various effects to the generated tone signals; alternatively, the tone 
generator circuit 23 may itself contain such an effect circuit. Timer 2N generates tempo clock 
pulses for counting a designated time interval or setting a desired performance tempo to 
reproduce recorded composition information, and the frequency of the performance tempo clock 
pulses is adjustable by a tempo switch (not shown). Each of the performance tempo clock 
pulses generated by the timer 2N is given to the CPU 21 as an interrupt instruction, in response 
to which the CPU 21 interruptively carries out various operations during an automatic 
performance. 
[0018] 

Now, with reference to Figs. 1 and 3 to 10, a detailed description will be made about 
exemplary behavior of the personal computer of Fig. 2 when it functions as the sound signal 
analyzing device. Fig. 1 is a flow chart of a main routine executed by the CPU 21 of the 
personal computer functioning as the sound signal analyzing device. 
[0019] 

At first step of the main routine, a predetermined initialization process is executed, where 
predetermined initial values are set in various registers and flags within the working memory 23. 
As a result of this initialization process, a parameter setting screen 70 is shown on the display 2G 
as illustrated in Fig. 7. The parameter setting screen 70 includes three principal regions: a 
recording/reproduction region 71; a rounding setting region 72; and a user setting region 73. 
[0020] 

The recording/reproduction region 71 includes a recording button 71 A, a MIDI 
reproduction button 71B and a sound reproduction button 71C. Activating or operating a 
desired one of the buttons starts a predetermined process corresponding to the operated button. 



Specifically, once the recording button 71A is operated, user's vocal sounds picked up by the 
microphone 2C are sequentially recorded into the sound signal analyzing device. Each of the 
thus-recorded sounds is then analyzed by the sound signal analyzing device to create a MIDI file. 
Basic behavior of the sound signal analyzing device is described in detail in Japanese Patent 
Application No. HEI-9-336328 filed earlier by the assignee of the present application, and hence 
a detailed description of the device behavior is omitted here. Once the MIDI reproduction 
button 71B is operated, the MIDI file created by the analyzing device is subjected to a 
reproduction process. It should be obvious that any existing MIDI file received from an 
external source can also be reproduced here. Further, once the sound reproduction button 71 C 
is operated, a live sound file recorded previously by operation of the recording button 71A is 
reproduced. Note that any existing sound file received from an external source can of course be 
reproduced in a similar manner. 
[0021] 

The rounding setting region 72 includes a 12-tone scale designating button 72A, an 
intermediate scale designating button 72B and a key scale designating button 72C, which are 
operable by the user to designate a desired scale rounding condition. In response to operation 
of the 12-tone scale designating button 72 A by the user, analyzed pitches are allocated, as a scale 
rounding condition for creating a MIDI file from a recorded sound file, to the notes of the 
12-tone scale (whole tone scale). In response to operation of the key note scale designating 
button 72C, pitches of input sounds are allocated, as the rounding condition, to the notes of a 
designated musical key. If the designated key scale is C major, the input sound pitches are 
allocated to the notes corresponding to the white keys. Note that, if reference pitches 
correspond to black keys, the respective pitches are shifted so as to correspond to the designated 
key scale. Further, in response to operation of the intermediate scale designating button 72B, a 
rounding process corresponding to the key scale is, in principle, carried out, in which, only when 
the analyzed result shows that the pitch corresponds to a note pitch other than key scale notes, 
the pitch is judged to be as the note pitch other than key scale notes. 
[0022] 

Fig. 8 conceptually shows different rounding conditions. More specifically, Figs. 8(A), 
8(B) and 8(C) are diagrams showing concepts of scale rounding conditions corresponding to the 
12-tone scale designation, intermediate scale designation and key scale designation. In Figs. 
8(A) to 8(C), the direction in which the keyboard keys are arranged (i.e., the horizontal direction) 
represents a sound pitch, i.e., sound frequency determined as a result of the sound signal analysis. 
Thus, for the 12-tone scale designation of Fig. 8(A), a boundary is set centrally between pitches 
of every adjacent scale notes, and the sound frequencies determined as a result of the sound 
signal analysis are allocated to all of the 12 scale notes. For the key scale designation of Fig. 
8(C), each sound frequency determined as a result of the sound signal analysis is allocated to any 
one of the scale notes using, as boundaries, the frequencies of the black-key-corresponding notes 
(C#, D#, F#, G# and A#). For the intermediate scale designation of Fig. 8(B), however, the 
frequency ranges allocated to the black-key-corresponding notes (C#, D#, F#, G# and A#) are set 
to be narrower than those set for the 12-tone scale designation of Fig. 8(A,) although the 
frequency allocation is similar, in principle, to that for the 12-tone scale designation of Fig. 8(A). 
More specifically, while the boundary is set centrally between pitches of every adjacent scale 
notes in the example of Fig. 8(A), the frequency range between the black-key-corresponding 
notes in the example of Fig. 8B is narrower than the example of Fig. 8(A) by about 1/3. Note 
that the frequency ranges may be set to any suitable values, and they may be enough to be 



8 



narrower than the example of Fig. 8(A) and the degrees thereof are entirely suitable. The 
reason why the black-key-corresponding notes (C#, D#, F#, G# and A#), i.e., non-diatonic scale 
notes, —denoted below the intermediate scale designating button 72B in Fig. 7 for illustration of 
scale allocation states— are each shown in an oval shape is that they correspond to the narrower 
frequency determining ranges. 
[0023] 

The rounding setting region 72 also includes a non-quantizing button 72D, a two-part 
dividing button 72E, a three-part dividing button 72F and a four-part dividing button 72G, which 
are operable by the user to designate a desired measure-dividing condition for the sound signal 
analysis. Once any one of these buttons 72D to 72G is operated by the user, the sound file is 
analyzed depending on a specific number of divided measure segments (i.e., measure divisions) 
designated via the operated button, to thereby create a MIDI file. To the right of the buttons 
72D to 72G, indicators of measure dividing conditions corresponding thereto are also visually 
displayed in instantly recognizable form. Namely, the indicator to the right of the 
non-quantizing button 72D shows that the start point of the sound duration is set optionally in 
accordance with an analyzed result of the sound file with no quantization. The indicator to the 
right of the two-part dividing button 72E shows that the start of the sound duration is set at a 
point corresponding to the length of an eighth note obtained, as a minimum unit note length, by 
halving one beat (quarter note). Similarly, the indicator to the right of the three-part dividing 
button 72F shows that the start of the sound duration is set at a point corresponding to the length 
of a triplet obtained by dividing one beat into three equal parts, and the indicator to the right of 
the four-part dividing button 72G shows that the start of the sound duration is set at a point 
corresponding to the length of a 16th note obtained, as a minimum unit note length, by dividing 
one beat into four equal parts. The number of the measure divisions mentioned above is just 
illustrative and any number may be selected optionally. 
[0024] 

Further, the user setting region 73 includes a level setting button 73A and a sound pitch 
range setting button 73B, activation of which causes a corresponding process to start. Namely, 
once the level setting button 73A is operated by the user, a level check screen is displayed as 
exemplarily shown in Fig. 9. This level check screen includes: a level meter area 91 using 
colored illumination to indicate a current sound volume level on a real-time basis; a level pointer 
92 moving vertically or in a direction transverse to the level meter calibrations as the sound 
volume level rises or falls; a sign 93 indicating that the level pointer 92 corresponds to a level 
indicating window 94 showing a currently-designated sound volume level in a numerical value; a 
confirming button ("OK" button) 95 for confirming the designated sound volume level; and a 
"cancel" button 96 for cancelling a level check process. Any desired numerical value can be 
entered into the level indicating window 94 directly via the keyboard 2E of Fig. 2. The user's 
vocal sound is analyzed in accordance with the sound volume level set via the level check 
screen. 
[0025] 

Once the sound pitch range setting button 73B is operated by the user, a pitch check screen 
is displayed as exemplarily shown in Fig. 10. This pitch check screen includes a first pointer 
101 for indicating an upper pitch limit in a currently-set sound pitch range, a second pointer 102 
for indicating an lower pitch limit in the currently-set sound pitch range, and a third pointer 109 
for indicating a pitch of a vocal sound currently input from the user, which together function to 
show which region of the keyboard 2E the currently-set sound pitch range corresponds to. The 



keyboard region in question may be displayed in a particular color different from that of the 
remaining region of the keyboard, in addition to or in place of using the first and second pointers 
101 and 102. The pitch check screen also includes a sign 103 indicating that the first pointer 
101 corresponds to a numerical value displayed by an upper pitch limit indicating window 105 
located adjacent to the sign 103, and a sign 104 indicating that the second pointer 102 
corresponds to a numerical value displayed by a lower pitch limit indicating window 106 located 
adjacent to the sign 104. Any desired numerical values can be entered into the upper and lower 
pitch limit indicating windows 105 and 106 directly via the keyboard 2E. The pitch check 
screen further includes a confirming or "OK" button 107 and a "cancel" button 108 similarly to 
the above-mentioned level check screen. The user's vocal sound is analyzed in accordance with 
the sound pitch range set via the pitch check screen. 
[0026] 

With the parameter setting screen 70 displayed in the above-mentioned manner, the user 
can set various parameters by manipulating the mouse 2C. The main routine of Fig. 1 executes 
various determinations corresponding to the user's manipulation of the mouse 2C. Namely, it is 
first determined whether or not the sound pitch range setting button 73B has been operated by 
the user, and if an affirmative (YES) determination is made, the routine carries out a sound pitch 
range setting process as shown in Fig. 3. In this sound pitch range setting process, a 
predetermined dialog screen is displayed, and detection is made of a pitch of a vocal sound input 
via the microphone 2C. Then, a user-designated sound pitch range is set as by changing the 
color of the keyboard region corresponding to the detected sound pitch and also changing the 
positions of the first and second pointers 101 and 102 on the dialog screen of Fig. 10. Such a 
series of sound pitch setting operations is repeated until the confirming (OK) button 107 is 
operated. Then, once the confirming (OK) button 107 is operated, a pitch-extracting band-pass 
filter coefficient is set in accordance with the keyboard region between the upper and lower pitch 
limits currently displayed on the dialog screen at the time point when the confirming (OK) 
button 107 is operated. In this way, the sound pitch range corresponding to the user's vocal 
sound can be set in the sound signal analyzing device. 
[0027] 

Next, in the main routine, a determination is made as to whether the level setting button 
73A has been operated in the user setting area 73 of the parameter setting screen 70, and with an 
affirmative (YES) determination, a sound-volume threshold value setting process is carried out 
as shown in Fig. 4. In this sound-volume threshold value setting process, the dialog screen of 
Fig. 9 is displayed, and detection is made of a volume level of the vocal sound input via the 
microphone 2C. Then, the color of the level meter area 91 is varied in real time in accordance 
with the detected sound volume level. Displayed position of the pointer 92 indicating a 
maximum sound volume level, i.e., a criterion or reference level, is determined in the following 
manner. Namely, it is ascertained whether or not the currently-detected sound volume level is 
higher than the currently-set reference level. If so, the criterion or reference level, i.e., the 
maximum sound volume level, and the displayed position of the pointer 92 are changed in 
conformity to the currently-detected sound volume level. If, on the other hand, the 
currently-detected sound volume level is lower than the current reference level, it is further 
determined whether the sound volume level has been found to be decreasing consecutively over 
the last n detections; if so (YES), the reference level, i.e., the maximum sound volume level, and 
the displayed position of the pointer 92 are changed in conformity to the currently-detected 
sound volume level. If the currently-detected sound volume level is lower than the current 
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reference level but the sound volume level has not necessarily been decreasing consecutively 
over the last n detections, it is further determined whether the sound volume level has been lower 
than a predetermined "a" value (e.g., 90% of the reference level) consecutively over the last m 
(m<n) detections; if so (YES), the reference level, i.e., the maximum sound volume level, and 
the displayed position of the pointer 92 are changed in conformity to the currently-detected 
sound volume level similarly to the above-mentioned. If, on the other hand, the sound volume 
level has not been lower than the "a" value consecutively over the last m detections, the current 
reference level is maintained. Through such a series of operations, the criterion or reference 
level, i.e., the maximum sound volume level, and the displayed position of the pointer 92 can be 
varied momently. The series of operations is repeated until the confirming (OK) button 95 is 
operated, upon which a sound volume threshold value, for use in pitch detection, key-on event 
detection or the like, is set in accordance with the maximum sound volume level (reference level) 
being displayed on the dialog screen of Fig. 9. 
[0028] 

Next, in the main routine of Fig. 1, a determination is made as to whether any one of the 
buttons 72A to 72G has been operated in the rounding setting region 72 of the parameter setting 
screen 70, and a rounding condition setting process is carried out as exemplarily shown in Fig. 5. 
In this rounding condition setting process, a different operation is executed depending on the 
button operated by the user. Namely, if one of the measure dividing buttons 72D to 72G has 
been operated, it is determined that a specific number of measure divisions has been designated 
by the user, so that a predetermined operation is executed for setting the designated number of 
measure divisions. If, on the other hand, one of the rounding condition designating buttons 72A 
to 72C has been operated, it is determined that a specific scale has been designated, so that a 
predetermined operation is executed for setting the scale (rounding of intervals or distances 
between adjacent notes) corresponding to the operated button. Such a series of operations is 
repeated until the confirming (OK) button 72H is operated. 
[0029] 

Finally, in the main routine of Fig. 1, a determination is made as to whether or not any 
button relating to performance or musical notation (or transcription) (not shown) has been 
operated by the user, and if so, a predetermined process is carried out which corresponds to the 
operated button. For example, if a performance start button has been operated by the user, a 
performance process flag is set up, or if a musical notation (or transcription) process start button 
has been operated, a musical notation process flag is set up. Upon completion of the 
above-described operations related to the parameter setting screen 70 of Fig. 7, the musical 
notation and performance processes are carried out in the instant embodiment. The musical 
notation process, which is carried out in this embodiment for taking the analyzed sound signal 
characteristics down on sheet music or score, is generally similar to that described in Japanese 
Patent Application No. HEI-9-336328 as noted earlier, and therefore its detailed description is 
omitted here for simplicity. The performance process is carried out on the basis of the 
conventionally-known automatic performance technique and its detailed description is also 
omitted here. It should also be appreciated that the musical notation process is performed in 
accordance with the scale rounding condition selected by the user as stated above. 
[0030] 

Fig. 6 is a flow chart illustrating an exemplary operational sequence of the musical 
notation process when the process is carried out in real time simultaneously with input of the 
vocal sound. Namely, while the sound signal analyzing device in the above-mentioned prior 



Japanese patent application is described as analyzing previously-recorded user's vocal sounds, 
the analyzing device according to the preferred embodiment of the present invention is designed 
to execute the musical notation process in real time on the basis of each vocal sound input via the 
microphone. In this musical notation or transcription process, detection is made of a pitch of 
each input vocal sound in real time. Note that various conditions to be applied in detecting the 
sound pitch, etc. have been set previously on the basis of the results of the above-described 
sound pitch range setting process. The thus-detected pitch is then allocated to a predetermined 
scale note in accordance with a user-designated scale rounding condition. Then, a 
determination is made as to whether there is a difference or change between the current allocated 
pitch and the last allocated pitch. With an affirmative (YES) determination, the same 
determination is repeated till arrival at a specific area of a measure corresponding to the 
user-designated measure-dividing condition, i.e., a "grid" point. Upon arrival at such a grid 
point, the last pitch, i.e., the pitch having lasted up to the grid point, is adopted as score data to be 
automatically written onto the music score. If there is no such difference or change between the 
current allocated pitch and the last allocated pitch, i.e., if the same pitch occurs in succession, it 
is adopted as score data to be written onto the score. By carrying out such a series of musical 
notation operations (i.e., operations for taking the analyzed signal characteristics down on the 
score) on the real-time basis, it is possible to create score data from the user's vocal sounds in a 
very simple manner, although the thus-created data are of rather approximate or rough nature. 
[0031] 

[Advantageous Results of the Invention] 

In summary, the present invention arranged in the above-mentioned manner affords the 
superior benefit that various parameters for use in sound signal analysis can be modified or 
varied appropriately depending on the types of the parameters and characteristics of user's vocal 
sounds. 

[Brief Description of the Drawings] 

[Fig. 1] A flow chart of a main routine carried out when a personal computer functions as a 
sound signal analyzing device in accordance with an embodiment of the present invention. 

[Fig. 2] A block diagram illustrating a general hardware setup of the personal computer 
functioning as the sound signal analyzing device; 

[Fig. 3] A flow chart illustrating details of a sound pitch setting process shown in Fig. 1 . 

[Fig. 4] A flow chart illustrating details of a sound-volume threshold value setting process 
shown in Fig. 1. 

[Fig. 5] A flow chart illustrating details of a process for setting a rounding condition etc. 
shown in Fig. 1. 

[Fig. 6] A flow chart showing an exemplary operational sequence of a musical notation 
process of Fig. 1. 

[Fig. 7] A diagram illustrating a parameter setting screen displayed as a result of an 
initialization process of Fig. 1. 

[Figs. 8] Diagrams conceptually explanatory of scale rounding conditions corresponding to 
12-tone scale designation, intermediate scale designation and key scale designation. 

[Fig. 9] A diagram illustrating a dialog screen displayed during the sound-volume threshold 
value setting process of Fig. 1. 

[Fig. 10] A diagram illustrating a dialog screen displayed during the sound pitch range 
setting process of Fig. 1 . 
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[Brief on the Reference Characters] 
21... CPU, 22... ROM, 23... RAM, 24... external storage device, 25... mouse 
operation detecting circuit, 26... mouse, 27... communication interface, 

28... communication network, 29... server computer, 2 A... MIDI interface, 2B... another 
MIDI instrument, 2C... microphone, 2D... microphone interface, 2E... keyboard, 2... 
keyboard operation detecting circuit, 2G... display, 2H... display circuit, 2J... tone 
generator circuit, 2K. . . effect circuit, 2L. . . sound system, 2N. . . timer. 
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[Name of Document] ABSTRACT 
[Abstract] 

[Subject] To enable to modify various parameters for use in the sound signal analysis in 
accordance with types of the parameters and characteristics of a user's vocal sound. 
[Means for Solving] Input means inputs a desired sound signal. Characteristic extraction 
means extracts a characteristic of the sound signal from the sound signal input from said input 
means. Setting means sets various parameters for use in analysis of the sound signal, in 
accordance with the characteristic of the sound signal extracted by said characteristic extraction 
means. Because of the arrangement that a characteristic of the input sound signal is extracted 
via the extraction means, even when the input sound signal variously differs depending on its 
sound characteristic (such as a user's singing ability), various parameters can be appropriately 
altered in accordance with the difference in the extracted characteristic of the sound signal, 
which thereby greatly facilitates setting of the necessary parameters. 
[Representative Figure] Figure 1 



